Amazon launches AI feature Lens Live, enabling real-time item scanning via phone camera for quick product matching and purchase. Currently iOS-only, it supports mobile scanning and specific item recognition using advanced object detection tech.....
Traycer, a VSCode AI assistant, enhances coding with task breakdown, multi-agent collaboration, and real-time error detection. Offers 14-day trial, excels in large codebases.....
The TEN Agent team recently announced that its core models **TEN Voice Activity Detection (VAD)** and **TEN Turn Detection** are now open source, providing powerful technical support for building real-time, multimodal speech AI agents. This move marks a significant advancement in the TEN framework's efforts to promote the democratization and open-source collaboration of speech interaction technology. The following is the latest information compiled by AIbase, offering an in-depth analysis of these two core models.
RF-DETR is an open-source, real-time object detection model that allows for the detection of objects within images and videos. It is commercially usable, making it a versatile tool for various applications.
RF-DETR is a real-time object detection model developed by Roboflow.
AI-generated audio watermarking technology
Real-time end-to-end object detection model
Real-time open vocabulary object detection
LeviDeHaan
SecInt is a SmolLM2-360M model fine-tuned for real-time nginx security log classification, aiming to automatically detect security threats, errors, and normal traffic patterns in web server logs with an accuracy of over 99%, enabling real-time detection on the CPU.
FluidInference
CoreML Silero VAD is a CoreML implementation of the Silero Voice Activity Detection (VAD) model, optimized for Apple platforms (iOS/macOS) and provides real-time voice activity detection capabilities.
trentmkelly
A binary text classification model for detecting AI-generated content in Reddit comments, supporting real-time detection in browser extensions.
TEN-framework
TEN VAD is a low-latency, lightweight, and high-performance streaming voice activity detection system, suitable for real-time voice processing scenarios.
nvidia
Lightweight multilingual voice activity detection model supporting six languages (Chinese, English, French, German, Russian, Spanish) with only 91.5K parameters, suitable for real-time speech processing scenarios
ustc-community
D-FINE is a real-time object detection model that achieves exceptional localization accuracy by redefining the bounding box regression task in the DETR model.
D-FINE is a powerful real-time object detection model that achieves excellent positioning accuracy by redefining the bounding box regression task in the DETR model.
D-FINE is a real-time object detection model that achieves exceptional localization accuracy by redefining the bounding box regression task.
jameslahm
YOLOE is a real-time visual omni-model that supports various vision tasks including zero-shot object detection.
YOLOE is a real-time visual omni-model that combines object detection and visual understanding capabilities, suitable for various visual tasks.
YOLOE is a zero-shot object detection model capable of detecting various objects in visual scenes in real-time.
YOLOE is an efficient, unified, and open model for object detection and segmentation, supporting various prompting mechanisms including text, visual inputs, and prompt-free paradigms, achieving real-time universal visual perception.
D-FINE is a real-time object detection model based on an improved DETR architecture, achieving exceptional localization accuracy by redefining the bounding box regression task.
PekingU
RT-DETRv2 is an improved real-time object detection model based on the DETR architecture, optimizing detection performance through innovations like selective multi-scale feature extraction and dynamic data augmentation.
RT-DETRv2 is an improved real-time object detection Transformer model that enhances performance through strategies such as selective multi-scale feature extraction and dynamic data augmentation.
RT-DETRv2 is an improved version of the real-time object detection Transformer model, enhancing performance through multi-scale feature extraction and optimized training strategies.
RT-DETRv2 is an optimized real-time object detection model based on the RT-DETR architecture. It improves detection accuracy while maintaining real-time performance through selective multi-scale feature extraction and enhanced training strategies.
Vombit
A lightweight Counter-Strike 2 player detection model based on YOLOv11, suitable for real-time object detection scenarios
WireMCP is an MCP server that provides real-time network traffic analysis capabilities for large language models (LLMs). It realizes data capture, threat detection, and network diagnosis by integrating the Wireshark tool.
A Structurizr DSL syntax debugging tool designed specifically for the Cursor IDE, providing real-time error detection, fix suggestions, and browser integration functions.
The Dynatrace MCP Server is a remote service that allows developers to interact with the Dynatrace observability platform, integrating real-time monitoring data directly into the development workflow. It supports functions such as problem detection, log query, and security vulnerability analysis.
The Volume Wall Detector MCP is a stock trading volume analysis server based on the MCP protocol, providing functions such as real-time trading volume analysis, price level detection, and trading imbalance tracking, and supporting connections with multiple MCP clients.
A professional SonicWall log analysis and threat detection MCP server that supports natural language queries of firewall logs, provides real-time threat monitoring and intelligent security analysis, and is compatible with SonicOS versions 7.x and 8.x.
Asterisk MCP Server is a middleware service that connects IDEs/code editors with Asterisk's security API through the Model Context Protocol (MCP) to provide real-time code security scanning, including code snippet detection, codebase analysis, and change verification.
The YOLO MCP Service is a powerful computer vision service that integrates with Claude AI through the Model Context Protocol (MCP), providing functions such as object detection, segmentation, classification, and real-time camera analysis.
The Volume Wall Detector MCP Server is a stock trading analysis tool based on the Model Context Protocol, providing functions such as real-time trading volume analysis, price level detection, and trading imbalance tracking, and supporting access from multiple MCP clients.
JADX-MCP-SERVER is a Python server that works with the JADX-AI-MCP plugin, enabling LLM (such as Claude) to analyze decompiled Android APK code in real-time through the MCP protocol, providing reverse engineering functions such as vulnerability detection and code understanding.
sec-mcp is a Python security detection toolkit that provides security check functions for domains, URLs, IPs, etc. It supports integration with Python applications, use in the terminal CLI, or running as an MCP server to provide real-time threat insights for LLMs.
This project implements a stock market analysis server based on the MCP protocol, providing functions such as real-time stock data acquisition, calculation of technical indicators (such as moving averages and RSI), and trend detection. It can be integrated with the AlphaVantage API and LLM to assist in financial decision-making.
Enterprise-level authentication management MCP server, providing multi-protocol authentication, real-time threat detection, and encrypted credential management functions
TEN Agent is a multi-functional AI agent framework that integrates real-time vision, voice recognition, and screen sharing detection capabilities, and supports rapid expansion and development.
An enterprise-level authentication management MCP server that provides multi-protocol authentication, real-time threat detection, and encrypted credential management functions